Methods, systems, and media for color palette extraction for video content items

ABSTRACT

Methods, systems, and media for color palette extraction for videos are provided. In some embodiments, the method comprises: identifying, at a server, a frame of a video content item; clustering pixels of the frame of the video content item based on a color of each of the pixels into a group of clusters; for each of a plurality of clusters in the group of clusters, determining an average color for the cluster; selecting a cluster of the plurality of clusters based on the average color of the cluster; determining a color palette corresponding to the frame of the video content item for one or more user interface elements in which the video content item is to be presented based on the average color of the selected cluster; and transmitting information indicating the color palette to a user device in response to a request to present the video content item.

TECHNICAL FIELD

The disclosed subject matter relates to methods, systems, and media for color palette extraction for video content items.

BACKGROUND

People frequently watch videos on user devices, such as their mobile phones, tablet computers, etc. These videos are often streamed from a video sharing service to the user device. In some cases, a video is presented within a user interface that can include, for example, video player controls (e.g., a pause control, a rewind control, etc.) and/or information about the video. However, the colors of different items in the user interface are typically static, and may therefore clash with the video during the presentation of the video, which can make the viewing experience jarring to the user. In a more particular example, the static portion of a user interface can continue to conflict with the viewing experience as the scene and colors change when the video is being played back in the user interface.

Accordingly, it is desirable to provide methods, systems, and media for color palette extraction for video content items.

SUMMARY

Methods, systems, and media for color palette extraction for video content items are provided. In accordance with some embodiments of the disclosed subject matter, methods for color palette extraction for videos are provided, the methods comprising: identifying, using a server that includes a hardware processor, a frame of a video content item; clustering a plurality of pixels of the frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of a plurality of clusters in the group of clusters, determining an average color for the cluster; selecting a cluster of the plurality of clusters based on the average color of the cluster; determining a color palette corresponding to the frame of the video content item for one or more user interface elements in which the video content item is to be presented based on the average color of the selected cluster; and transmitting information indicating the color palette to a user device in response to a request to present the video content item.

In accordance with some embodiments of the disclosed subject matter, systems for color palette extraction for videos are provided, the systems comprising: a hardware processor that is programmed to: identify, at a server, a frame of a video content item; cluster a plurality of pixels of the frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of a plurality of clusters in the group of clusters, determine an average color for the cluster; select a cluster of the plurality of clusters based on the average color of the cluster; determine a color palette corresponding to the frame of the video content item for one or more user interface elements in which the video content item is to be presented based on the average color of the selected cluster; and transmit information indicating the color palette to a user device in response to a request to present the video content item.

In accordance with some embodiments of the disclosed subject matter, non-transitory computer-readable media containing computer executable instructions that, when executed by a processor, cause the processor to perform a method for color palette extraction for videos are provided. In some embodiments, the methods comprise: identifying, at a server, a frame of a video content item; clustering a plurality of pixels of the frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of a plurality of clusters in the group of clusters, determining an average color for the cluster; selecting a cluster of the plurality of clusters based on the average color of the cluster; determining a color palette corresponding to the frame of the video content item for one or more user interface elements in which the video content item is to be presented based on the average color of the selected cluster; and transmitting information indicating the color palette to a user device in response to a request to present the video content item.

In accordance with some embodiments of the disclosed subject matter, a system for color palette extraction for videos are provided, the systems comprising: means for identifying, at a server, a frame of a video content item; means for clustering a plurality of pixels of the frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of a plurality of clusters in the group of clusters, means for determining an average color for the cluster; means for selecting a cluster of the plurality of clusters based on the average color of the cluster; means for determining a color palette corresponding to the frame of the video content item for one or more user interface elements in which the video content item is to be presented based on the average color of the selected cluster; and means for transmitting information indicating the color palette to a user device in response to a request to present the video content item.

In some embodiments, the systems further comprise means for converting the information indicating the color palette from a first format to a second format prior to transmitting the information indicating the color palette to the user device.

In some embodiments, the systems further comprise means for determining whether to select the average color of the selected cluster as a main color corresponding to the frame of the video content item based on the average color of the selected cluster.

In some embodiments, the cluster of the group of clusters is selected based on a similarity of the average color of the cluster to a previously analyzed frame of the video content item.

In some embodiments, the systems further comprise means for determining a plurality of color palettes each corresponding to a theme of a plurality of themes to be applied to a user interface in which the video content item is to be presented.

In some embodiments, each color palette includes a first color corresponding to user interface controls in an active state and a second color corresponding to user interface controls in an idle state.

In some embodiments, the systems further comprise: means for generating a mosaic of a plurality of frames of the video content item; means for determining a main color for each frame in the plurality of frames in the mosaic; and means for generating a color palette for each frame in the plurality of frames in the mosaic based on the main color of each frame in the plurality of frames in the mosaic.

BRIEF DESCRIPTION OF THE DRAWINGS

Various objects, features, and advantages of the disclosed subject matter can be more fully appreciated with reference to the following detailed description of the disclosed subject matter when considered in connection with the following drawings, in which like reference numerals identify like elements.

FIG. 1 shows an example of a process for color palette extraction for a frame of a video in accordance with some embodiments of the disclosed subject matter.

FIGS. 2A and 2B show illustrative examples of schematic diagrams for color palette extraction for multiple frames of a video in accordance with some embodiments of the disclosed subject matter.

FIG. 3 shows a schematic diagram of an illustrative system suitable for implementation of mechanisms described herein for color palette extraction for videos in accordance with some embodiments of the disclosed subject matter.

FIG. 4 shows a detailed example of hardware that can be used in a server and/or a user device of FIG. 3 in accordance with some embodiments of the disclosed subject matter.

FIG. 5 shows an example of a user interface for presenting a video content item based on a determined color palette in accordance with some embodiments of the disclosed subject matter.

DETAILED DESCRIPTION

In accordance with various embodiments, mechanisms (which can include methods, systems, and media) for color palette extraction for video content items are provided.

In some embodiments, the mechanisms described herein can identify content color data that describes one or more colors of the content appearing in a frame of a video. For example, in some embodiments, the content color data can indicate a dominant color of the frame or a color that appears most frequently within content of the video during that frame. In some embodiments, the mechanisms can then generate a color palette based on the identified content color data. For example, in some embodiments, the color palette can indicate a color of various portions of a user interface in which the video is presented on a user device, such as a section indicating information about the video, colors of video player controls, colors of graphical overlays superimposed on presentation of the video content item, and/or any other suitable portions of the user interface. In some embodiments, the mechanisms described herein can repeat the content color data identification and color palette generation for multiple frames of the video (e.g., frames that are one second apart throughout the video, and/or any other suitable frames). The mechanisms can then transmit the color palette information for the frames to a user device that requested the video for presentation, thereby allowing the user device to use the color palette information to dynamically change a color scheme of one or more user interface elements in which the video is presented during presentation of the video.

In some embodiments, the mechanisms can determine the content color data using any suitable technique or combination of techniques. For example, in some embodiments, the mechanisms can cluster pixels within a frame based on a color of each pixel, and can identify the cluster having the most pixels. In another example, in some embodiments, the mechanisms can cluster pixels within a particular area of a frame by grouping or connecting pixels within a particular color range and can identify the color range or color ranges having a number of pixels greater than a particular threshold number of pixels. The mechanisms can then determine that a main color or a dominant color for the frame is the average color, or other suitable combination, of the pixels within the cluster having the most pixels or the average color, or other suitable combination, of the pixels within the cluster or clusters having a number of pixels greater than a threshold number of pixels.

In some embodiments, the mechanisms described herein can allow a user device on which the video content item is presented to modify colors of one or more user interface elements of a user interface for presenting a video content item with less processing by using a server to analyze colors of the multiple frames included in the video content and using the server to transmit the determined color palette information to the user device (e.g., as opposed to the processing time and processing power required by the user device to analyze the colors associated with multiple frames of the video content either prior to the video content being presented or while the video content is being presented). Additionally, by transmitting the color palette information in a computer-readable format, such as an array of color values where each color value corresponds to a different segment of the video content item, the color palette information can be transmitted without requiring much additional bandwidth relative to the bandwidth required to transmit the video content item to the user device. Furthermore, by transmitting the color palette information as an array of color values, individual user devices that receive the color palette information can modify the color palette information in any suitable manner in modifying colors of the user interface elements. For example, in some embodiments, an individual user device that receives a video content item from a server with corresponding color palette information can modify transparency values associated with a color for a particular user interface element, and/or modify color values in any other suitable manner. In another example, in some embodiments, a user device executing a first media playback application to present a video content item can apply the color palette information differently than the same user device executing a second media playback application to present the video content item. In yet another example, in some embodiments, a user device can apply the color palette information on a user interface presented by a media playback application differently than a different user device executing the same media playback application to present the user interface for presenting the video content item (e.g., based on device capabilities, based on device preferences, etc.).

Turning to FIG. 1, an example 100 of a process for color palette extraction for frames of a video is shown in accordance with some embodiments of the disclosed subject matter. In some embodiments, the blocks of process 100 can be implemented on one or more servers, as shown in and described below in connection with FIG. 3.

In some embodiments, process 100 can begin at 102 by identifying a frame of a video for analysis. In some embodiments, the frame can be chosen in any suitable manner. For example, in some embodiments, the frame can be a first frame of the video. As another example, in some embodiments, process 100 can loop through the video and identify every Nth frame for analysis (e.g., every tenth frame, every twentieth frame, and/or any other suitable frame). As a more particular example, in some embodiments, process 100 can identify one frame that represents any suitable duration of time of the video (e.g., one second, two seconds, and/or any other suitable duration). As a specific example, if the frame rate of the video is 10 frames per second, and process 100 identifies one frame that represents a second of the video, process 100 can identify every tenth frame for analysis. As yet another more particular example, in some embodiments, process 100 can identify particular frames for analysis based on any suitable criterion, such as a particular object appearing in the frame, a particular color threshold being met (e.g., such that initial frames including black space is not submitted for analysis), etc.

At 104, in some embodiments, process 100 can cluster pixels of the identified frame based on colors of the pixels. In some embodiments, process 100 can cluster the pixels into any suitable number of pixels (e.g., two, five, ten, and/or any other suitable number). Note that, in some embodiments, the number of clusters can vary based on any suitable information (e.g., a genre of the video content item, a resolution of the video content item, and/or any other suitable information). In some embodiments, process 100 can cluster the pixels of the identified frame using any suitable technique or combination of techniques. For example, in some embodiments, process 100 can cluster the pixels using a group of thresholds corresponding to each cluster. As a more particular example, in some embodiments, process 100 can compare color values (e.g., hue, saturation, and value, or HSV, parameters for each pixel, and/or any other suitable values) for a pixel to predetermined threshold values corresponding to different clusters to assign the pixel to a particular cluster. As a specific example, process 100 can determine that a pixel with a hue parameter between 26 degrees and 52 degrees, a saturation parameter between 0.3 and 0.6, and a value parameter between 0.3 and 0.6 is to be assigned to a first cluster. As another example, in some embodiments, process 100 can use any suitable statistical classification or machine learning technique(s) to cluster the pixels (e.g., a K-nearest neighbors algorithm, and/or any other suitable clustering algorithm).

At 106, in some embodiments, upon obtaining multiple clusters of pixels, process 100 can calculate an average color for each cluster in the top N clusters with the most pixels assigned to the cluster. For example, in an instance where process 100 clustered the pixels into ten clusters at block 104, N can be any suitable number less than ten (e.g., three, five, and/or any other suitable number). In some embodiments, process 100 can rank the clusters from block 104 based on the number of pixels assigned to each cluster and can calculate the average color for each of the N top-ranked clusters. In some embodiments, process 100 can calculate an average color using any suitable technique or combination of techniques. For example, in some embodiments, process 100 can average individual components of a color indicating scale (e.g., Red, Green, Blue, or RGB, values, HSV values, and/or any other suitable color indicating scale). As a more particular example, in instances where colors are indicated using RGB values, process 100 can calculate the average color by averaging all R values for all of the pixels in the cluster, all G values for all of the pixels in the cluster, and all B values for all of the pixels in the cluster, and calculating the average color as the color resulting from the average R, G, and B values. In some embodiments, process 100 can discard one or more pixels that have been determined as likely to be outliers, for example, black pixels in the middle of a group of non-black pixels, pixels that are similar to particular colors (e.g., similar to and/or close to white, similar to and/or close to black, similar to and/or close to particular shades of yellow, and/or any other suitable colors or shades), and/or any other suitable pixels. For example, in instances where process 100 discards pixels that are white or similar to white and where color information is stored in an RGB format, process 100 can discard any pixels with color information within a predetermined range of (255, 255, 255), such as pixels with red, green, and blue pixels above a predetermined threshold (e.g., above 250, above 220, and/or any other suitable threshold).

In some embodiments, process 100 can select a cluster of the top N clusters with an average color most similar to content color data of a previously analyzed frame at 108. For example, in some embodiments, process 100 can select the cluster with an average color most similar to a main color or dominant color of a frame corresponding to the previous second of video. As another example, in some embodiments, process 100 can select the cluster with an average color most similar to a main color or dominant color of the frame that was previously analyzed by process 100. In some embodiments, process 100 can determine similarity of the average colors of the clusters to the content color data of the previous frame using any suitable technique or combination of techniques. For example, in some embodiments, process 100 can calculate a distance metric (e.g., a Euclidean distance, and/or any other suitable distance metric) of the average color of each of the top N clusters and the main color of the previous frame, and can select the cluster with the lowest distance metric.

Note that, in some embodiments, process 100 can select the main color or the dominant color of the first frame using any suitable technique or combination of techniques. For example, in some embodiments, process 100 can use the clustering technique described above in connection with block 104 to partition the first frame into any suitable number of clusters, and can use the technique described above in connection with block 106 to determine an average color of each of the clusters. In some embodiments, process 100 can select the main color or the dominant color of the first frame based on the average colors of the clusters. For example, in some embodiments, process 100 can select the main color or the dominant color to be the average color of the largest cluster. As another example, in some embodiments, process 100 can select the main color or the dominant color to be the average color of a cluster in a particular spatial region of the first frame. As a more particular example, in some embodiments, process 100 can select the main color or the dominant color to be the average color of a cluster that spans at least a center portion of the first frame.

At 110, in some embodiments, process 100 can determine whether the selected average color determined to be most similar to content color data of a previously analyzed frame is to be selected as the content color data of the frame currently being analyzed. For example, in some embodiments, process 100 can determine whether the selected average color is to be selected as a main color or dominant color of the frame currently being analyzed. Process 100 can determine whether to select the average color based on any suitable information. For example, in some embodiments, process 100 can determine that the average color is not to be selected as the content color data of the current frame if the average color is similar to a particular color and/or shade of color (e.g., a particular shade of yellow, a particular shade of orange, and/or any other suitable particular color or shade). In some embodiments, process 100 can determine that the average color is to be selected as content color data of the current frame as a default. Note that, in some embodiments, process 100 can make any suitable modifications to the average color prior to setting the average color to be the content color data of the current frame. For example, in some embodiments, process 100 can adjust a saturation and/or a brightness level of the average color. As a more particular example, in instances where the average color is yellow and/or similar to yellow, process 100 can adjust a saturation and/or a brightness value associated with the average color to be below a predetermined threshold (e.g., below 65, below 80, and/or any other suitable value). In some embodiments, process 100 can preferentially select particular colors as the content color data based on any suitable information. For example, in some embodiments, process 100 can determine that the selected content color data is to be a shade of a particular color (e.g., red, blue, green, and/or any other suitable color) based on an entity associated with a service providing the video content item. As a more particular example, process 100 can determine that the selected content color data is to be a shade of red if a first video sharing service or social networking service is providing the video content item and can determine that the selected content color data is to be a shade of blue if a second video sharing service or social networking service is providing the video content item.

Note that, in some embodiments, process 100 can process multiple frames of the video at once. For example, in some embodiments, process 100 can generate a mosaic representing multiple frames of the video spanning any suitable duration of the video, as shown by mosaic 200 of FIG. 2A. For example, in some embodiments, mosaic 200 can have any suitable number of frames (e.g., 4, 9, 25, and/or any other suitable number), where each frame represents any suitable portion of the video. As a more particular example, in some embodiments, each frame can be one second apart in the video, and a mosaic with, for example, 25 frames can therefore represent 25 seconds of the video. In some embodiments, process 100 can then identify content color data (e.g., a main color and/or a dominant color of the frame, and/or any other suitable content color data) for each frame in mosaic 100, using the techniques described above in connection with blocks 104-110. Process 100 can then generate a color mosaic 250 that indicates content color data for each frame in mosaic 100. Color mosaic 250 can therefore indicate, in one mosaic, content color data for multiple frames of the video.

It should be noted that mosaic 200 of FIG. 2A is merely illustrative and can be presented in any suitable manner. For example, each analyzed frame can be presented in a mosaic configuration and, within each presented frame, the determined color content data can be concurrently presented in a portion of the frame (e.g., an area containing the determined color content data in a corner of the frame). In another example, each analyzed frame can be blended, or otherwise combined, with the determined color content data in the presented mosaic.

If, at block 110, process 100 determines that the average color is not to be selected as the content color data of the current frame (“no” at 110), process 100 can loop back to 108 and select an average color of a different cluster. For example, process 100 can determine that a cluster having the next highest number of pixels is used to determine the color content data.

If, at block 110, process 100 determines that the average color is to be selected as the content color data of the current frame (“yes” at 110), process 100 can proceed to block 112 and can determine a color palette for the current frame based on the content color data. In some embodiments, the color palette can specify colors for any suitable portions of a user interface in which the video content item is being presented. For example, as shown in user interface 500 of FIG. 5, the color palette can specify colors for different sections 504 of user interface 500 in which video 502 is being presented. As a more particular example, individual sections within sections 504 can each indicate different types of information, such as information about the video (e.g., a name of the creator, a number of views of the video, and/or any other suitable information), comments from viewers of the video, recommended videos determined to be similar to the video, and/or any other suitable information. In some embodiments, each section of sections 504 can be presented in a text and with a background color based on the content color data selected at block 110. Additionally, in some embodiments, the color palette can indicate colors of any suitable user interface navigation components, as indicated in navigation panel 506, such as video player controls (e.g., a pause control, a rewind control, a video frame navigation control, and/or any other suitable controls). In some embodiments, the color palette can assign colors to particular user interface elements based on a state of the user interface. For example, the color of a user interface control when the user interface is determined to be in an active state (e.g., when a cursor is hovered in a particular position within user interface 500, when a particular portion of user interface 500 is active, and/or based on any other suitable determination) can be different than the color of the user interface control when the user interface is determined to be in an idle state, as shown in user interface 500.

In some embodiments, process 100 can determine the colors of the color palette in any suitable format, as shown in schematic 550. For example, in some embodiments, process 100 can identify a color for each type of content in content types 552. In some embodiments, the colors in the color palette can be in any suitable format, for example, in hexadecimal values as shown in FIG. 5, and/or in any other suitable format (e.g., HSV values, RGB values, and/or any other suitable format). Additionally, as shown in FIG. 5, the color palette information can include any other suitable information, such as contrast ratios between different types of content. For example, as shown in FIG. 5, the color palette information can indicate a contrast ratio for a user interface icon in an idle state with text in a particular section of user interface 500.

Note that, in some embodiments, process 100 can determine multiple color palettes, for example, corresponding to different themes based on the content color data. For example, in some embodiments, process 100 can determine a first color palette corresponding to a dark theme, a second color palette corresponding to a light theme, and a third color palette corresponding to a vibrant theme. As a more particular example, in some embodiments, the first color palette corresponding to the dark theme can contain background colors that are generally darker than corresponding colors in the light theme and the vibrant theme. In some embodiments, a first color palette (e.g., corresponding to a dark theme) can be generated, and other color palettes corresponding to other themes (e.g., light, vibrant, and/or any other suitable themes) can be generated based on the first color palette, for example, by decreasing color values by a predetermined percentage (e.g., by 5%, by 10%, and/or by any other suitable percentage) for each theme. In some embodiments, each of the first color palette, second color palette, and third color palette can have different contrast ratios based on background text colors and text colors for each color palette. For example, in some embodiments, the contrast ratios can be set such that text when appearing superimposed on a background of a particular color is generally readable (e.g., complies with any suitable accessibility standards, and/or meets any other suitable standards).

In some embodiments, process 100 can loop through the video and determine color palette information for multiple frames or groups of frames (e.g., using mosaics as described above in connection with FIG. 2) for the video content item. For example, in instances where process 100 analyzes every ten frames of the video, process 100 can process the subsequent frame in the sequence of frames. In some embodiments, process 100 can store color palette information for each frame or portion of the video in an array, and can store the aggregated color palette information in any suitable format, such as an array of arrays.

Process 100 can convert the color palette to any suitable format at block 114. For example, in some embodiments, process 500 can convert individual color palette values (e.g., a color of a background of a particular section, a color of a navigation control in an active state, and/or any other suitable color palette values) in a hexadecimal format to equivalent values in an RGB format. In some embodiments, process 100 can use any suitable equations and or transformation algorithms to convert the color palette values to the suitable format. In some embodiments, process 100 can select the format type based on a type of user device which the color palette values are to be sent to. Additionally or alternatively, in some embodiments, process 100 can convert the color palette values to multiple formats to be suitable for multiple device types. In some embodiments, process 100 can indicate any other suitable information in the color palette for each frame, such as an opacity/transparency associated with each item in the color palette. In some embodiments, the opacity/transparency value can have a default value (e.g., 1 indicating a fully opaque element, and/or any other suitable value), and the value can be modified by a user device when rendering the video content item in accordance with the color palette.

Note that, in some embodiments, process 100 can convert the color palette to a particular format based on a platform of a user device that is to receive the color palette information. For example, in some embodiments, process 100 can convert the color palette to a first format for user devices that are desktop computers, a second format for user devices that are mobile devices, and a third format for user devices that are smart televisions.

In some embodiments, process 100 can transmit the color palette to a user device at 116. In some embodiments, the user device can be any suitable user device, such as a user device that requested the video content item from a server hosting the video. In some embodiments, process 100 can transmit multiple color palettes, where each color palette corresponds to a subset of frames (e.g., one frame of the video, one second of the video, and/or any other suitable subset) of the video. Additionally, in some embodiments, multiple color palettes, each corresponding to a different theme (e.g., dark, light, vibrant, and/or any other suitable theme) can be transmitted to the user device.

In some embodiments, the user device can then use the received color palette(s) to dynamically modify a user interface in which the video is being presented during presentation of the video. For example, in some embodiments, colors of video player controls can be modified to correspond to an average color of a particular subset of the video during presentation of that subset of the video. As another example, in some embodiments, colors of sections of the user interface can be modified to correspond to an average color of a particular subset of the video during presentation of that subset of the video.

Turning to FIG. 3, an example of an illustrative system 300 suitable for implementation of mechanisms described herein for color palette extraction for videos is shown in accordance with some embodiments of the disclosed subject matter is shown. As illustrated, hardware 300 can include one or more servers, such as a server 302, a communication network 304, and/or one or more user devices 306, such as user devices 308 and 310.

In some embodiments, server(s) 302 can be any suitable server(s) for identifying color palettes for different frames of a video. For example, in some embodiments, server(s) 302 can identify content color data such as a main color or a dominant color for a particular frame of the video and can identify a palette of colors that can be used in different user interface contexts when presenting the video based on the content color data, as shown in and described above in connection with FIG. 1. Note that, in some embodiments, the content color data of the frame can be calculated on a first server (e.g., an image server) and the color palette can be determined on a second server (e.g., a server that hosts and/or transmits the video to user devices 306). In some embodiments, server(s) 302 can identify the content color data in any suitable manner, for example, by clustering pixels of the frame based on color and identifying an average color for each pixel, as described above in connection with FIG. 3.

Communication network 304 can be any suitable combination of one or more wired and/or wireless networks in some embodiments. For example, communication network 306 can include any one or more of the Internet, an intranet, a wide-area network (WAN), a local-area network (LAN), a wireless network, a digital subscriber line (DSL) network, a frame relay network, an asynchronous transfer mode (ATM) network, a virtual private network (VPN), and/or any other suitable communication network. User devices 306 can be connected by one or more communications links 312 to communication network 304 that can be linked via one or more communications links (e.g., communications link 314) to server(s) 302. Communications links 312 and/or 314 can be any communications links suitable for communicating data among user devices 306 and server(s) 302 such as network links, dial-up links, wireless links, hard-wired links, any other suitable communications links, or any suitable combination of such links.

In some embodiments, user devices 306 can include one or more computing devices suitable for presenting a video content item, receiving color palette information from server(s) 302, presenting user interface controls in accordance with the received color palette information, and/or any other suitable functions. For example, in some embodiments, user devices 306 can be implemented as a mobile device, such as a smartphone, mobile phone, a tablet computer, a laptop computer, a vehicle (e.g., a car, a boat, an airplane, or any other suitable vehicle) entertainment system, a portable media player, and/or any other suitable mobile device. As another example, in some embodiments, user devices 306 can be implemented as a non-mobile device such as a desktop computer, a set-top box, a television, a streaming media player, a game console, and/or any other suitable non-mobile device.

Although server 302 is illustrated as a single device, the functions performed by server 302 can be performed using any suitable number of devices in some embodiments. For example, in some embodiments, the functions performed by server 302 can be performed on a single server. As another example, in some embodiments, multiple devices can be used to implement the functions performed by server 302.

Although two user devices 308 and 310 are shown in FIG. 3, any suitable number of user devices, and/or any suitable types of user devices, can be used in some embodiments.

Server(s) 302 and user devices 306 can be implemented using any suitable hardware in some embodiments. For example, in some embodiments, devices 302 and 306 can be implemented using any suitable general purpose computer or special purpose computer. For example, a server may be implemented using a special purpose computer. Any such general purpose computer or special purpose computer can include any suitable hardware. For example, as illustrated in example hardware 400 of FIG. 4, such hardware can include hardware processor 402, memory and/or storage 404, an input device controller 406, an input device 408, display/audio drivers 410, display and audio output circuitry 412, message interface(s) 414, an antenna 416, and a bus 418.

Hardware processor 402 can include any suitable hardware processor, such as a microprocessor, a micro-controller, digital signal processor(s), dedicated logic, and/or any other suitable circuitry for controlling the functioning of a general purpose computer or a special purpose computer in some embodiments. In some embodiments, hardware processor 402 can be controlled by a server program stored in memory and/or storage 404 of a server (e.g., such as server 302). For example, the server program can cause hardware processor 402 to analyze frames of a video to determine a main color for each frame, determine a color palette for each frame based on the main color of the frame, transmit the color palette information to user devices in connection with transmission of the video, and/or perform any other suitable actions. In some embodiments, hardware processor 402 can be controlled by a computer program stored in memory and/or storage 404 of user device 306. For example, the computer program can cause hardware processor 402 to present a video content item, present user interface controls in connection with the received video content item in accordance with received color palette information, and/or perform any other suitable actions.

Memory and/or storage 404 can be any suitable memory and/or storage for storing programs, data, media content, advertisements, and/or any other suitable information in some embodiments. For example, memory and/or storage 404 can include random access memory, read-only memory, flash memory, hard disk storage, optical media, and/or any other suitable memory.

Input device controller 406 can be any suitable circuitry for controlling and receiving input from one or more input devices 408 in some embodiments. For example, input device controller 406 can be circuitry for receiving input from a touchscreen, from a keyboard, from a mouse, from one or more buttons, from a voice recognition circuit, from a microphone, from a camera, from an optical sensor, from an accelerometer, from a temperature sensor, from a near field sensor, and/or any other type of input device. In another example, input device controller 406 can be circuitry for receiving input from a head-mountable device (e.g., for presenting virtual reality content or augmented reality content).

Display/audio drivers 410 can be any suitable circuitry for controlling and driving output to one or more display/audio output devices 412 in some embodiments. For example, display/audio drivers 410 can be circuitry for driving a touchscreen, a flat-panel display, a cathode ray tube display, a projector, a speaker or speakers, and/or any other suitable display and/or presentation devices.

Communication interface(s) 414 can be any suitable circuitry for interfacing with one or more communication networks, such as network 304 as shown in FIG. 3. For example, interface(s) 414 can include network interface card circuitry, wireless communication circuitry, and/or any other suitable type of communication network circuitry.

Antenna 416 can be any suitable one or more antennas for wirelessly communicating with a communication network (e.g., communication network 304) in some embodiments. In some embodiments, antenna 416 can be omitted.

Bus 418 can be any suitable mechanism for communicating between two or more components 402, 404, 406, 410, and 414 in some embodiments.

Any other suitable components can be included in hardware 400 in accordance with some embodiments.

In some embodiments, at least some of the above described blocks of the process of FIG. 1 can be executed or performed in any order or sequence not limited to the order and sequence shown in and described in connection with the figure. Also, some of the above blocks of FIG. 1 can be executed or performed substantially simultaneously where appropriate or in parallel to reduce latency and processing times. Additionally or alternatively, some of the above described blocks of the process of FIG. 1 can be omitted.

In some embodiments, any suitable computer readable media can be used for storing instructions for performing the functions and/or processes described herein. For example, in some embodiments, computer readable media can be transitory or non-transitory. For example, non-transitory computer readable media can include media such as non-transitory forms of magnetic media (such as hard disks, floppy disks, etc.), non-transitory forms of optical media (such as compact discs, digital video discs, Blu-ray discs, etc.), non-transitory forms of semiconductor media (such as flash memory, electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), etc.), any suitable media that is not fleeting or devoid of any semblance of permanence during transmission, and/or any suitable tangible media. As another example, transitory computer readable media can include signals on networks, in wires, conductors, optical fibers, circuits, any suitable media that is fleeting and devoid of any semblance of permanence during transmission, and/or any suitable intangible media.

In situations in which the systems described herein collect personal information about users, or make use of personal information, the users may be provided with an opportunity to control whether programs or features collect user information (e.g., information about a user's social network, social actions or activities, profession, a user's preferences, or a user's current location). In addition, certain data may be treated in one or more ways before it is stored or used, so that personal information is removed. For example, a user's identity may be treated so that no personal information can be determined for the user, or a user's geographic location may be generalized where location information is obtained (such as to a city, ZIP code, or state level), so that a particular location of a user cannot be determined. Thus, the user may have control over how information is collected about the user and used by a content server.

Accordingly, methods, systems, and media for color palette extraction for videos are provided.

Although the invention has been described and illustrated in the foregoing illustrative embodiments, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the details of implementation of the invention can be made without departing from the spirit and scope of the invention, which is limited only by the claims that follow. Features of the disclosed embodiments can be combined and rearranged in various ways. 

What is claimed is:
 1. A method for color palette extraction for videos, the method comprising: identifying, using a server that includes a hardware processor, a plurality of frames of a video content item, wherein the plurality of frames include a first frame and a second frame; determining a main color corresponding to the first frame; clustering a plurality of pixels of the second frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of at least a portion of clusters from the group of clusters, determining an average color for the cluster; determining whether to set the average color for a cluster of the portion of clusters as a main color corresponding to the second frame based on the main color corresponding to the first frame, wherein the first frame precedes the second frame in the video content item; determining a color palette for each of the first and second frames in the plurality of frames based on the main color of each frame, wherein the color palette for each of the first and second frames includes a plurality of colors corresponding to the main color corresponding to the frame and wherein the color palette assigns one of the plurality of colors to a portion of an interface in which the video content item is to be presented; and transmitting information indicating the color palette to a user device in response to a request to present the video content item.
 2. The method of claim 1, further comprising converting the information indicating the color palette from a first format to a second format prior to transmitting the information indicating the color palette to the user device.
 3. The method of claim 1, further comprising selecting the cluster from the at least a portion of clusters based on a similarity of the average color of the cluster to the first frame of the video content item.
 4. The method of claim 1, further comprising determining a plurality of color palettes each corresponding to a theme of a plurality of themes to be applied to a user interface in which the video content item is to be presented.
 5. The method of claim 1, wherein each color palette includes a first color corresponding to user interface controls in an active state and a second color corresponding to user interface controls in an idle state.
 6. The method of claim 1, further comprising: generating a first mosaic that includes each of the plurality of frames of the video content item; determining a main color for each frame in the plurality of frames in the mosaic; and generating a second mosaic that includes the main color of each of the plurality of frames of the video content item, wherein a color palette for each frame in the plurality of frames in the first mosaic is based on the determined main color of each frame in the plurality of frames in the second mosaic.
 7. The method of claim 1, further comprising determining that the average color for the cluster of the portion of clusters corresponds to a particular color and, in response to determining that the average color corresponds to the particular color, adjusting at least one of a saturation level and a brightness level associated with the average color.
 8. A system for color palette extraction for videos, the system comprising: a hardware processor that is programmed to: identify a plurality of frames of a video content item, wherein the plurality of frames include a first frame and a second frame; determine a main color corresponding to the first frame; cluster a plurality of pixels of the second frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of at least a portion of clusters from the group of clusters, determine an average color for the cluster; determine whether to set the average color for a cluster of the portion of clusters as a main color corresponding to the second frame based on the main color corresponding to the first frame, wherein the first frame precedes the second frame in the video content item; determine a color palette for each of the first and second frames in the plurality of frames based on the main color of each frame, wherein the color palette for each of the first and second frames includes a plurality of colors corresponding to the main color corresponding to the frame and wherein the color palette assigns one of the plurality of colors to a portion of an interface in which the video content item is to be presented; and transmit information indicating the color palette to a user device in response to a request to present the video content item.
 9. The system of claim 8, wherein the hardware processor is further programmed to convert the information indicating the color palette from a first format to a second format prior to transmitting the information indicating the color palette to the user device.
 10. The system of claim 8, wherein the hardware processor is further programmed to select the cluster from the at least a portion of clusters based on a similarity of the average color of the cluster to the first frame of the video content item.
 11. The system of claim 8, wherein that hardware processor is further programmed to determine a plurality of color palettes each corresponding to a theme of a plurality of themes to be applied to a user interface in which the video content item is to be presented.
 12. The system of claim 8, wherein each color palette includes a first color corresponding to user interface controls in an active state and a second color corresponding to user interface controls in an idle state.
 13. The system of claim 8, wherein that hardware processor is further programmed to: generate a first mosaic that includes each of the plurality of frames of the video content item; determine a main color for each frame in the plurality of frames in the mosaic; and generating a second mosaic that includes the main color of each of the plurality of frames of the video content item, wherein a color palette for each frame in the plurality of frames in the first mosaic is based on the determined main color of each frame in the plurality of frames in the second mosaic.
 14. The system of claim 8, wherein the hardware processor is further programmed to determine that the average color for the cluster of the portion of clusters corresponds to a particular color and, in response to determining that the average color corresponds to the particular color, adjusting at least one of a saturation level and a brightness level associated with the average color.
 15. A non-transitory computer-readable medium containing computer executable instructions that, when executed by a processor, cause the processor to perform a method for method for color palette extraction for videos, the method comprising: identifying, at a server, a plurality of frames of a video content item, wherein the plurality of frames include a first frame and a second frame; determining a main color corresponding to the first frame; clustering a plurality of pixels of the second frame of the video content item based on a color of each of the pixels in the plurality of pixels into a group of clusters; for each of at least a portion of clusters from the group of clusters, determining an average color for the cluster; determining whether to set the average color for a cluster of the portion of clusters as a main color corresponding to the second frame based on the main color corresponding to the first frame, wherein the first frame precedes the second frame in the video content item; determining a color palette for each of the first and second frames in the plurality of frames based on the main color of each frame, wherein the color palette for each of the first and second frames includes a plurality of colors corresponding to the main color corresponding to the frame and wherein the color palette assigns one of the plurality of colors to a portion of an interface in which the video content item is to be presented; and transmitting information indicating the color palette to a user device in response to a request to present the video content item.
 16. The non-transitory computer-readable medium of claim 15, wherein the method further comprises converting the information indicating the color palette from a first format to a second format prior to transmitting the information indicating the color palette to the user device.
 17. The non-transitory computer-readable medium of claim 15, wherein the method further comprises selecting the cluster from the at least a portion of clusters based on a similarity of the average color of the cluster to the first frame of the video content item.
 18. The non-transitory computer-readable medium of claim 15, wherein the method further comprises determining a plurality of color palettes each corresponding to a theme of a plurality of themes to be applied to a user interface in which the video content item is to be presented.
 19. The non-transitory computer-readable medium of claim 15, wherein each color palette includes a first color corresponding to user interface controls in an active state and a second color corresponding to user interface controls in an idle state.
 20. The non-transitory computer-readable medium of claim 15, wherein the method further comprises: generating a first mosaic that includes each of the plurality of frames of the video content item; determining a main color for each frame in the plurality of frames in the mosaic; and generating a second mosaic that includes the main color of each of the plurality of frames of the video content item, wherein a color palette for each frame in the plurality of frames in the first mosaic is based on the determined main color of each frame in the plurality of frames in the second mosaic.
 21. The non-transitory computer-readable medium of claim 15, wherein the method further comprises determining that the average color for the cluster of the portion of clusters corresponds to a particular color and, in response to determining that the average color corresponds to the particular color, adjusting at least one of a saturation level and a brightness level associated with the average color. 